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ABSTRACT 



Certain aspects of statistical decision theory are applied to 
assessing objectively the uncertainty involved in estimating which of 
his courses of action (capabilities) an antagonist is pursuing in an 
international conflict of interest situation. Observations are made by 
a nation on its antagonist’s conduct and interpreted as being associated 
with its courses of action, Bayes formula is then used to change a con- 
jectured a priori probability function over the antagonist’s courses of 
action in order to estimate the course of action being pursued by the 
antagonist. A set of decision rules for selecting an appropriate course 
of action to use against an antagonist’s estimated course of action is 
developed^, based on preference orderings and preference quantification. 

The conclusions of this thesis are: 

(1) in spite of uncertainty concerning the course of action being 
pursued by an antagonist, it is possible to select an appropriate 
course of action to be pursued when using decision rules based 

on preference orderings and preference quantification . 

(2) that an international conflict situation may be quantitatively assess- 
ed. 

This thesis was written at the U. S. Naval Postgraduate School, 
Monterey, California, during the period January - May 1960. I am 
indebted to Professors Thomas E. Oberbeck and Franklin F. Sheehan 
for their continued patience, encouragement and most capable guidance 
while acting as faculty advisors. I wish to thank Mrs. Norma Stevens 
for her meticulous preparation of this manuscript. 
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CHAPTER I 



INTRODUCTION 

This thesis concerns itself with the application of certain aspects 
of statistical decision theory to decision making on policy implementa- 
tion at a national level. The specific application made is in the domain 
of foreign policy as it concerns two nations with a conflict of interest 
situation existing between them. 

Since World War IIj, the Free World under the leadership of the 

United States and the Communist Block under the leadership of the Soviet 

Union have been engaged in a struggle to determine whether the nations 

■* 

of the world will have the right to choose their own way of life and 
their own form of government or whether the leaders of the Soviet Union 
will dictate the conditions under which they will live. As yet the out- 
come of this struggle has not yet been decided but the indications are 
that we are losing this struggle, for since the close of World War II 
the Soviet Union has extended its sphere of influence over the major 
portion of the Eurasian land mass and its peoples and has begun to probe 
for handholds in South America and Africa. 

In the many small engagements that have characterized this struggle 
so far the United States has had to make many decisions. In Korea for 
example we were faced with the decisions as to how far north to allow 
our forces to proceed and whether or not to permit our aircraft the 
freedom of hot pursuit north of the Yalu River into Manchuria. Both of 
the resulting decisions made at the time were made in the face of uncertain- 
ty as to what Communist China and the U. S.S.R. would do. Hungary is an- 
other example where we were faced with the prospect of having to make a 



1 



j 



decision in which we were uncertain as to what our antagonist would do. 
The result in this case was that effectively we did nothing. In Cuba 
today the United States again is faced with a decision problem in which 
there is uncertainty as to what Cuba intends to do. 

It is interesting to speculate what course of action the United 
States would have pursued had it known what the intentions of its 
antagonist were in each of the above situations or, more realistically, 
had it utilized a systematic and quantitative process for objectively 
assessing the uncertainty in its estimate as to the intentions of our 
antagonist. The United States, armed with information as to what course 
of action our antagonist was pursuing, might have been able to select 
a course of action, the results of which would have been more advanta- 
geous to this nation and the Free World. The word "might" is used be- 
cause of the fact that assuming such a course of action existed, we 
might have been unwilling to pursue it since it violated our traditional 
standards. For example, a course of action which dictates starting a 
major war against Communist China would violate our tradition of not 
initiating aggressive action against another nation. 

For the foreseeable future these conflict of interest situations 
will continue to arise as part of the overall "protracted conflict" 
characterized by Strausz-Hupe, Kintner, Dougherty and Cottrell in their 
book entitled Protracted Conflict , The United States in this 

atmosphere will be repeatedly confronted with the enigma of making a 
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decision in the face of uncertainty. 

Morton A. Kaplan, Department of Political Science, University of 



discusses and formulates six distinct International Systems based on 
the alignment of nations and relations within this alignment which 
characterize them. In the latter portion of his book he discusses game 
theory and its relation to the "solution" of conflict of interest situa- 
tions in international politics, pointing out its limitations and draw- 
backs. 

Abraham Wald in his text entitled "Statistical Decision Functions" 
4j points out that the statistical decision problem can be interpreted 
as a two-person game. The author in this thesis chooses to view a con- 
flict of interest situation as a statistical decision problem. 

A conflict of interest situation involving the United States and 
the Soviet Union can be considered as a two person game in which the 
United States is a contestant playing against the Soviet Union. Both 
the United States and the Soviet Union have courses of action, designat- 
ed pure strategies in the language of game theory, which they can util- 
ize in the given situation; these courses, which also may be character- 
ized as capabilities of action, are actually a listing by the United 
States of all the possible courses of action which it considers the 
Soviet Union can pursue which are relevant to the particular situation 
being considered, as well as a listing of its own capabilities. The 
context of "course of action" is that the actions of a nation indicate 
that it will utilize the enumerated course of action to gain its 
objective. For example, troop and combat aircraft buildups in the 
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vicinity of Berlin by the Soviet Union might be interpreted by the 
United States that the U. S.S.R, is pursuing a "hot war" course of 
action in the Berlin crisis, while the U.S.S.R. “s permitting United 
States aircraft to use routes outside the prescribed air corridors 
while in flight to Berlin might be interpreted by the United States to 
mean that the U.S.S.R. is pursuing a "cold war" course of action in the 
Berlin crisis. 

On the other hand, the United States might construe the sending of 

its armed forces into Lebanon as its "hot war" course of action while 

the banning of trade with Communist China could be construed as its 

* 

"cold war" course of action. 

Inasmuch as the U.S.S.R. is not going to inform us on the courses 
of action which it is following, the advisors in the State Department 
have the problem of estimating which course of action the U.S.S.R. is 
pursuing in order to provide the President with a basis for making a 
decision on a course of action by the United States to counter that 
of the U.S.S.R. The course of action selected will be chosen relative 
to a decision criterion from the list that the United States has pre- 
pared of its capabilities, but the choice of this course of action will 
depend in part on which estimated course of action the Soviet Union is 
pursuing. 

In the theory of this thesis, the advisors, in order to make their 
estimates, assign probabilities to the courses of action of the U.S.S.R. . 
These probabilities represent the probability with which the United States 
conjectures (estimates) that the U.S.S.R. will use its courses of action 
or mixture of courses of action. A mixture of courses of action is 
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interpreted as employment by the U. S.S.R. of all its courses of action 
or some subset of its courses of action where the frequency with which 
the individual courses of action are to be utilized has been carefully 
predetermined by the U. S.S.R. and the order of their employment is 
determined on a probabilistic basis. The advisors then observe the 
conduct of the U. S.S.R. and interpret specific acts as one of a set 
of possible courses of action of the Soviet Union. The observation 
of the conduct of the Soviet Union may then be used to change the 
assigned probabilities on the Soviet Union's courses of action through 
the use of Bayes theorem which provides the quantative basis for chang- 
ing these probabilities. The a priori probabilities required for the 
use of Bayes theorem are the probabilities assigned by the United States 
to the courses of action of the U, S.S.R. in an initial estimate of the 
situation. 

Bayes theorem and statistical decision theory will help the United 
States to select the course of action it should pursue. Bayes theorem 
improves the assessment of the uncertainty as to which of the conjectured 
courses of action the U. S.S.R. is pursuing by changing the a priori pro- 
babilities of the courses of action. The changed probabilities are then 
known as a posteriori probabilities and these represent the United States 
estimate of the probabilities that the U. S.S.R, is utilizing these partic 
ular courses of action. Statistical decision theory may be employed by 
representatives of the United States to choose objectively an appropri- 
ate course of action. 

The onus of "judgment of values" is placed upon the decision maker 
who utilizes the model, which will be constructed in the next chapter, 
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for formulating his decision, '* Judgment of values** is particularly 
important in determining the preference pattern for the possible out^ 
comes which result from the United States pursuing its courses of 
action and the U,SoS,R. pursuing its courses of action. The decision 
maker must be particularly careful that the resulting preference pat- 
tern reflects the **true** ordering of preferences of the United States 
for the outcomes and is not colored by his personal beliefs. 

The solution of the decision maker's problem rests in the final 
analysis on the selection of the **appropriate course of action** to 
counter the antagonists estimated course of action. The words **appro- 
priate course of action** are synonymous with the selection of a course 
of action which is **optimal according to the criterion established for 
making a decision,** This criterion might be for example the maximiza- 
tion of own national interests, these interests may be construed as 
**utilities** for the possible outcomes which result from the possible 
combinations of courses of action. It may be the minimization of the 
**risks** involved in the courses of action or any criterion which has 
meaning and **value** to the decision maker. In any event the decision 
maker must establish the criterion and make his selection of the **appro- 
priate** course of action on this basis. 

The relationship between the Operations Analysis curriculum and the 
problem analyzed lies in a definition of the purpose of Operations Anal- 
ysis which is **to provide a quantitative basis for executive decision,** 

1 

Paraphrase of a definition of the purpose of Operations Analysis, 
Robert F, Rinehart, Journal of Operations Research Society of 
America ^ 2, 1954. 
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The author proposes that in this context, this thesis falls within 
the province of Operations Analysis. More generally, the author be- 
lieves that every person who calls himself a citizen of a country has 
an obligation to attempt to apply the discipline in which he has been 
trained to problems faced by his country. 
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CHAPTER II 



GENERAL FORMULATION 

Conjecture a situation involving a conflict of interests between 
two nations 5 Nation I and Nation II. Nation I will be the nation making 
a decision in the face of uncertainty. Nation I has a set of two pos- 
sible courses of action (pure strategies) that it can pursue and its 
list of the possible courses of action that it estimates II can pursue 
consists of two courses of action. These courses of action are denoted 
by: 

(a^, a^) for I 
(bj, b^) for II 

Let the courses of action allowed have the following connotation: 

is interpretable as conducive to a non-shooting war or 
cold war 

is interpretable as conducive to a shooting war or hot 
war 

bj is interpretable as conducive to a non-shooting war or 
cold war 

b^ is interpretable as conducive to a shooting war or 
hot war 

The possible situations which can occur may be described as: 

(Sj, b^), (a^, h^), b^), (a^, b^) or 

(C, C), (C, H), (H, C), (H, H) 

Where C represents cold war and H represents hot war. 

Suppose that the order of preference for these situations by Nation 
I is represented as: 

(C, C)>(H, C)>(H, H)>(c, H) 
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with the meaning that the situation to the left of the symbol > 
is preferred (by I) to the situations to the right. 



Subject to the general requirements of utility theory ; (1) 



that preference shall be transitive, i.e., if A^B and B^C then 
A^C; (2) any gamble can be decomposed into its basic alternatives 
according to the rules of probability calculus; and, (3) if there ex- 
ists a gamble in which A is preferred to B and B to C, then there 
shall exist a gamble Involving A and C which is indifferent to B; 
a quantification of these preferences by I can be indicated as follows: 



with the meaning that the "value" of +1 attached to (C, C) is a stand- 
ard of preference for a desirable situation, the "value" -1 attached to 
(H, H) is a standard of preference for an undesirable situation. In the 
example being considered a has a value less than -1 and b has a value 
less than +1 and greater than -1. If all possible preference patterns 
are considered, a and b are arbitrary real numbers, "values", assigned 
to the situations (c, H) and (H, C) respectively relative to the stan- 
dards -fl and -1. 



For example, we might consider a = -2 and b =-|-l/2 giving: 

(C, C)>(H, C)>(H, H)>(C, H) since 
+ l> + l/2>-l>-2 
or if b = +5 and a « -5 
(H, C)>(C, C)>(H, H)>(C, H) since 
+ 5>^l>-l>-5 




(C, C) = +1 

(H, H) = -1 



(C, H) 



a 



(H, C) 



b 



9 



I 




A matrix representation of the conflict situation with associated pre- 
ference values assigned by I would appear as illustrated in Fig. 1 
where a and b are any real numbers 




Conflict Situation Matrix and "Values" assigned by I 

Figure 1 

All the numerical figures in the matrix are assigned by Nation I 
and as such are meaningful to it* They represent a value to I which 
it associates with the possible outcomes of the various combinations 
of courses of action available to I and !!• These numerical values 
are intended to reflect the "utility” to I of each of the possible 
situations^ in the sense described by R. Duncan Luce and Howard Raiffa 
in Games and Decisions ^2 . Nation I must choose an action based on 

its estimate of which course of action or mixtures of courses of action 
II is pursuing* I*s choice will be that course of action or mixture 
of courses of action which gives it the greatest expected return. Assum- 
ing that I has no information or at best limited infoxrmation on the 
course of action II is pursuing, a reasonable procedure for it to follow 
would be to observe II ®s actions and from these observations estimate 
what course of action II is pursuing* The observations represent the 
stochastic process of the statistical decision problem 4j . 

An observation by I of an action by II is represented as the 
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occurrence of one or the other of the pair b^, b^. We shall 
identify this pair as = (b^, b^). The pair b^ and b^ re- 

present the stochastic variables of the statistical decision process 
1^4 . If II employs a mixture of its courses of action, I may identify 

this mixture as represented in Fig. 2 where Y denotes the relative 
frequency (or odds) with which II is using b^, and (1 - Y) the re- 
lative frequency with which II is using \> 2 * Inasmuch as I does not 
know the value of Y, but knows that Y lies in the interval (0, 1) 
we regard Y as a stochastic variable, Y represents the true distri- 
bution, F, associated with one of the independent variables in the 
stochastic process. The values of Y constitute the parameter space, 

, of the statistical decision problem [ 4 ] . We shall consider 
two cases, namely Y is discrete and Y is continuous. 



c 



TI 



Y 



ll-YI 






Odds assigned by I to 
Figure 2 

For the case where Y is discrete, only certain values in the 
interval (0, 1) are allowed; these we shall designate by the notation 

(yn)n » 1, 2, ,N) where N is the number of discrete values of Y 

in the interval (0, 1). When Y is discrete with N values, we have a 
probability function f(y„) such that Pr (Y = y^) = ~ 



n 



n 



IV 

0, ^ P =1. Nation I*s initial a priori estimate is 



n-1 



f(y )o After I observes II courses of action one or more times with 



n 



the 



results^ say, occurring i times and occurring j times, 



the resulting a posteriori distribution which will be designated 
is given by Bayes formula to be; 
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pp.. 



ri 





( 2 . 1 ) 



n, ( 






Assuming stochastic independence of successive observations, the a 
posteriori probability after any set of observations becomes I’s a 
priori estimate for the next observation. Nation I*s initial a priori 
estimate for the discrete case may be designated f^ ^ 

In the discrete case we shall consider several possibilities, namely: 
Case (1) . 



N * 2; Y can assume only two values say, y^ = 0 and 

1 with probabilities and P 2 , respectively, as represented in 

Fig. 3, where the height of the vertical bars represents I’s initial 
a priori estimate. 



N 






! h 



Y — - 

O 1 

showing probabilities that 



Y = y 



n 



Figure 3 

A numerical example of this case is worked out in Chapter III using 
an initial a priori estimate P^^ = ^2 “ 

= 1/4 The general results are that I requires only one observation 

to change its a priori estimate to an a posteriori estimate in which 
the probability one attaches to one or the other of II *s courses of 
action. This is so regardless of the conjectured a priori estimate. 
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Case (2) . 



N s 3; Y can assume only the values, say, y^^ * 0, 
such that 0 -=^y 2 = 1 and y^ * 1 with probabilities Pj^, P^, P^» 
respectively, as represented in Fig, 4 where the height of 
the vertical bars represents I's initial a priori estimate. 




Pz 



P. 



N = 3 showing probabilities that ^ 

Figure 4 

A numerical example of this case is worked out in Chapter III using 
an initial a priori probability f(y^) » 1/3, f(y 2 ) *= 1/6 and 
fCy^) *» 1/2. The general results are that the a priori estimate is 
changed after one observation to an a posteriori probability in which 
non-zero probabilities are assigned to either y^ and y 2 or to y 2 
and y^* As long as the opposite action is not observed in subsequent 
observations, the probability remains distributed over y^ and y 2 or 
over y 2 and y^ with the probability on y^^ or y^ increasing with 
each observation. However^ as soon as an action indicating the opposite 
course of action is observed, the a posteriori probability one attaches 
to y 2 » In this situation all I knows is that II is pursuing a mixture 
of courses of action, i.e. 0 ■= y 2 = 1, but the exact value of y 2 he 
does not know. 

Case (3) . 

N » 4; Y can assume only four values, say, y^^ »* 0, and 

y^ such that 0 < Y 2 ^ y 3 ^ 1 ^4 “ ^ with probabilities Pj^, P 2 , 
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P ^3 P^, respectively, as represented in Fig. 5, where the height of 

the vertical bars represents I's initial a priori estimate. 



Y 






1^ 



^3 






0 1 

N ^ 4, showing probabilities that Y » y 



n 



Figure 5 

A numerical example of this case is worked out in Chapter III using an 
initial a priori probability f(yj^) = 1/4, f(y 2 ) ® l/^» ^(y^) ® 
1/4, and f(y^) = 1/4. The general results are that the a priori 
estimate is changed after one observation to an a posteriori probability 
in which all the probability is distributed over either y^^, Y 2 and 
y^ or over y^, y^ and y^. As long as the opposite action is not 
observed in subsequent observations, the probability remains distributed 
over either y^^, y 2 and y^ or over y^ and y^ with the pro- 

bability on y^^ or y^ increasing with each observation. However, as 
soon as an action indicating the opposite course of action is observed, 
the a posteriori probability assigns non-zero values to the two assumed 
mixtures y^ and y^. In this situation all I need estimate is that 
II's mixture of courses of action lies in a sector either to the left or 
right of a mixture y* lying between y 2 and y^. The side to which 
it lies is determined by the conjectured mixture, y 2 or y^, having 
the greatest a posteriori probability. 

We shall see in the discussion of expected return to I that if the 
+1 



quantity 



b 



lies in the interval (0,1), I need only estimate 



whether Y is greater than or less than 



a + 1 



a + D 



in order to maximize 
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lies outside this interval, 



the expected return to him. If 

-*• t) 



I®s course of action or mixture of courses of action which maximizes 
this expected return is uniquely determined. 

When Y is continuous we consider all possible values in the in- 
terval (0,1) and associate with this interval a probability density 
function, f(y)o Nation I*s initial a priori estimate when Y is 
continuous is f(y). After I observes II* s courses of action one or 
more times with the results, say, bj^ occurring i times and b^ occur- 
ring j times, the resulting a posteriori probability density function 
is given by Bayes formula to be: 



Assuming stochastic independence of successive observations, the a 
posteriori probability density function after any set of observations 
becomes the a priori estimate for the next observation. Nation I*s 
initial a priori estimate for the case where Y is continuous may be 
designated as f^ ^(y) « f(y)* As an example, if I regards all values 
of Y as equally likely, then f(y) is uniform, i.e. f(y) = 1, and 
may be shown as in Fig. 6. 




f(y)y^Cl-yV 



for 0 - y - 1 




o I id zr rc <r -V ! I c r <?. 



0 



%) 



Y 



1 



o 



Uniform density function associated v’ith Y 



Figure 6 
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A numerical example of this case is worked out in Chapter III using 
an initial a priori density function f(y) which is uniform. The gen- 
eral results are that the a priori probability density function is chang- 
ed after each observation and the resulting a posteriori probability den- 
sity function is biased in favor of the value of Y associated with 
II' s course of action, b^^ or b^, observed the greater number of times. 
Up to this point we have been concerned with I's view of the set, 

C , of courses of action, b and b-, which I considers II is employ 
11 1 z 

ing with probabilities Y and (1-Y), respectively. Let us now turn our 
attention to the courses of action (a^^ and a^) available to I. We 
shall say that I has an action space consisting of a^^ and which 

we shall identify as « (a^, a^). Nation I will employ either of 

his courses of action a^^ and a^, or a mixture of his courses of action 
selecting that course of action or mixture of courses of action which is 
"optimal” relevant to his estimate of II' s behavior. In this general 
formulation of the conflict situation and the numerical examples which 
follow we shall use the criterion that I will maximize his national in- 
terests, that is to say, I will maximize the expected (average) "value" 
of the return. We may identify, C^, as represented in Fig. 7, where 
X denotes the relative frequency with which I uses a^ and (1-x) the 
relative frequency with which he uses a 2 > x lies in the interval 
( 0 , 1 ). 



X 



(1-x) 




I's action space showing the relative 
frequency of use of his two courses of action 



Figure 7 
16 



We use a little x in this case since we are not dealing with a 

stochastic variable. The quantity x is equal to — * if a mix- 

a < r -' 

ture of courses of action by I is indicated and 0 or 1 otherwise, 
depending on I's a posteriori estimate of Y. These three values of 
X will be discussed in the discussion on the expected "value" of the 
return to I. 

Our matrix representation of the conflict situation, Fig. 1, 
may now be represented as in Fig. 8. 




Conflict Situation Matrix showing "values" 
assigned by I and Relative Frequencies of 
Cj and 

Figure 8 

The expected (average) value of the return to I is a function of 
the courses of action available to I and II which we shall designate by 



the symbol E K(Cj, 



E (Cj, meaning 



that considering all the possible situations (outcomes) that can occur 
and the relative frequency with which they may occur, the expected 
"value" of the return to I is the sum over all possible situations 
which can occur with the value of each possible situation multiplied 
by the frequency of occurrence of that situation. The term "expected" 
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value of the return to I is not to be interpreted as the value that 
will necessarily obtain for a single occurrence of the conflict situa- 
tions in which I uses a particular course of action against his parti- 
cular estimate of the course of action II is pursuing, but rather may 
be interpreted as a **value” of the return to I due to the uncertainty 
as to which situation will prevail. As an example, consider the con- 
flict situation represented by Fig, 8, its expected value is given by 
equation (2,3)* 



(2.3) E[K(q,Cn)] = lxY + 2.K(l-T) + b(l-x)Y +(-l)(i-x)(l-y) 




b+1 \ 






The quantity E in equation (2.3) is the function that I will 
maximize with respect to x for his estimated value of Y, Fig, 9 
is a pictorial representation over the a, b plane showing the values 



for X for which E [k (C^, 



is maximized for the situation 



represented in Fig, 8. The Fig, 9 results from a detailed analysis 
of the **gains*’ to I which result from all possible combinations of 
values of a and b. From equation (2,3) and Fig. 9 we see that 
the value of x which maximizes E jK (C^, determined not 

from the specific value of Y estimated, but rather the relationship / 



of the estimated Y to the quantity 



. The values, a and b, 



b 

it will be recalled, are the quantities assigned by I to the situations 
(c< H) (H, C) and reflect his preferences for these outcomes of 

the conflict situation. 

Note that if Y = — — ^ , indicated by the vertical dotted 

+ b 
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line in Fig* 9, x can be set equal to any value in the interval 
(O 5 1) and the expected **value** of the return to I is the same for 



all values of x* A corresponding phenomenon occurs when I utilizes a 

b •^1 

mixture of his courses of action such that x = ; — , and the ex- 

pected value of the return to I is the same as when Y = — — — . 

7 \ 

This provides I with a means of coping with a situation which arises 

when an action is required of I prior to his estimation of the course 

b > 1 



of action II is pursuing, for if 



is in the interval ( 0 , 1 ) he 



b -^1 

employs the mixture indicated by x = and the expected value of 

the return to I is the same no matter which course of action II is pur- 
suing* It will be recalled from Chapter I that to use this mixture I 
must select that course of action a^ or a^ by means of some random 
device, such as a spinner with the circumference partitioned in pro- 
portion to the relative frequencies of utilization of a^ and 

Fig* 9 is also a pictorial representation of I*s decision rules 
for the use of his own courses of action, once he has enumerated his 
own preference ordering, for it is noted that a particular preference 
ordering Is associated with only one Area of the a, b plane. For ex- 
ample, in Area I only the following preference orderings are represented 



1 . (H, C)>(C, C)>(C. H)>(H, H) 

2. (H, C)>(C. H)>(C, C)>(H, H) 

3. (H, C)>(C, H)~(C, C)>(H, H) 

4. (H, C) - (C, H)>(C, C)>(H, H) 

5. (C, H)>(H, C) >(C, C)>(H. H) 

where the symbol-^ has the meaning that the situation to the left of the 
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symbol is indifferent (I has equal preference for) to the situation to 
the right. Areas II, III and IV have a similar preference pattern 
lists, but different preference ordering. 

+ 1 

From Fig. 9, we see that except for the special case Y = , 

the action to be taken by I is a^^ or a 2 corresponding to x = 1 or 

X ^ 0 respectively. The decision by I in favor of a^^ or a^ is based 

on the knowledge of the ”true*’ value of Y which Nation I presumably 

does not know. Nation I is therefore confronted with estimating Y, 

and with an estimate of Y he may use Fig. 9 to reach a decision in 

favor of a^ or a^. In the language of statistical decision theory, 

^ t 

I®s estimate of Y represents the terminal decision space, D ; the ter- 
minal decision reached by I is then used in the game theoretic analysis 
summarized in Fig. 9 to reach a point in the action space consisting of 
and a 2 « We will now turn to a consideration of I*s problem of de- 
ciding upon an estimate of Y. The rules which will be developed below 
represent the statistical decision function of statistical decision 
theory. 

Let us first consider the case where Y is discrete. Suppose that 
a sufficient number of observations on II' s courses of 'action have been 
made so that the a posteriori probability f^^j associated with a 

particular discrete value y^ is greater than or equal to some value, 
say .9, which I considers sufficient for him to decide that the particu- 
lar y is indeed the "true" value of Y. Then I uses the value of the 
n 

particular y^ for Y to enter Fig, 9 and comes to a decision in favor 
of or a 2 » This case in the language of statistical decision theory 

is the case of making a point estimate of Y. 
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The decision procedure just described requires that a sufficient 
number of observations have been made of II* s actions so that one of 
the values of Y, say y^, has achieved a **suf f icient** probability^ 
say o9, to lead I to a point estimate of ¥• This procedure will not 
be satisfactory in two situations. The first of these is for the case 
where Y is continuous. The second situation is the case where Y is 
discrete but the requirement for a decision by I to use a^ or a^ 
arises before a sufficient number of observations have been obtained 
to allow I to arrive at a point estimate of Y as described above. 

For these cases an alternate criterion is suggested. It is based on 
the maximization of the expected **value** of the return to I and the 
fact that I need only estimate whether Y is less than or greater 
than in order to select a^^ or a^ so as to maximize the expect- 

ed '"value** of the return to him. In the discrete case I computes the 
expected **value** of the return to him for all y^ if he uses a^^, 
and if he uses a^, and employs his course of action which gives him 
the greater expected return. This expectation is computed with respect 
to the a posteriori probabilities. We shall symbolize this as: 

(2.4) Choose 5 ^ for which ftj ^ 

maximum for k - . 

In the continuous case I follows a similar procedure to that described 
for the discrete case except that (2.4) becomes: 

(2.5) Choose for which J" ^ djy is a 

maximum for . 

Note that this alternate criterion differs from the procedure of making 
a point estimate of Y in that it allows a decision to be made at any 
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time^ ioe, before any observations have been made on II* s courses of 
action or after any number of observations have been made. 



A more sophisticated method I may use in deciding upon an estimate 
Y is a complete analog of the Statistical Decision Problem considered 
by Abraham Wald j^4 in which the cost of experimentation and the cost of 
a wrong decision must be considered after each observation when determin- 
ing whether to act now or take another observation. The analogous com- 
ponents for I to consider in his problem could be the **cost** of observa- 
tion, this **cost** being either real or conjectured, and the **gain** which 
would accrue to him if he took another observation. When the **cost** of 
making another observation is greater than the **gain** to I, he makes his 
estimate of Y. This case requires that **cost** and **gain'* be commensur- 
able with a and b. It will not be considered further in this thesis. 
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CHAPTER III 



NUMERICAL EXAMPLES 

General Situation 

Let us suppose that a conflict of interest situation has arisen 
between Nation I and Nation II wherein let us say Nation II is offer- 
ing to supply arms to the Beta States. Additionally, let us suppose 
that these same Beta States are engaged in open hostilities with their 
neighbor Gamma, Nation I, which has declined to supply arms to either 

side, considers that it is vital to its interests to prevent the supply 

of arms to the Beta nations. In this situation let us consider that 
Nation I has two courses of action available to it, namely: 

(1) attempt to gain its objective without resort to open hostilities. 

For example, this might be through the use of diplomatic chan- 
nels only or the United Nations. This course of action we 

shall call aj^, which is equivalent to a cold war course of 
action. 

(2) attempt to gain its objective by resorting to open hostilities. 
For example, this might be through physically preventing the 
receipt of such shipments by the Beta States, This course 

of action we shall call a^, which is equivalent to a hot war 
course of action. 

Similarly Nation I conjectures that Nation II has two courses of action, 
namely: 

(1) attempt to gain its objective without resort to open hostili- 
ties. This course of action we shall call b^^, which is 
equivalent to a cold war course of action. 
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(2) attempt to gain its objective through resort to open hostili- 
ties. This course of action we shall call b^, which is equi- 
valent to a hot war course of action. 

Having set the stage let us now proceed to examine the various cases 
discussed in the general formulation of the problem. First the case where 
Y is discrete and the three cases enumerated under it. 

Case (1) 

N = 2; Y can assume only two values, and 72*^1 “ 

and y 2 “ ^ 

Assumptions ; 

(1) Nation I's preference ordering: 

(ai> bj^)>>(a2, b^^) >■ (a^, (a^, b^) 

or equivalently 

(C, C)>(H, C)>(H, H)>(C, H) 

(2) Quantification of Nation I*s preference ordering: 

+l>l/2>-l>-2 

(3) Conjectured a priori probability: 



o 

o 


= Pr (Y = yp 


- 3/4; 


Yl = 0 


0 , 0 <>' 2 > 


= Pr (Y = y^) 


= 1/4; 


N> 

It 

1— » 



Let us suppose that the first observation on Nation II* s courses 
of action results in The a posteriori estimate is by Bayes for- 

mula : 
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( 2 . 1 ) 






\/ ; 
^ ^ ' 



yn^a-yn)'-' 



yn^--yr)' 






(3/4 )(1) 



(3/4 )(1) + (l/4)(0) 



r f \ 

^Ql'Ya' 



(l/4)(0) 



(3/4) a) + (1/4 )(0) 



= 0 



We see that after one observation, Nation I is enabled to estimate 
that Y = 0 and that Nation II is pursuing its hot war course of 

action. It is interesting to note that all the a posteriori probability 
would be on for an observation of b^ regardless of the a priori 

probability assumed and on for an observation of b^^. 

Examination of Fig. 9 shows that Nation I*s preference pattern 

lies in Area III. In this area the decision rules for Nation I are: 

^ 1 



(1) Y< 

(2) Y> 



select a«, the hot war course of action, 
b ' 2* 



^ b 
a -^1 



select a^^, the cold war course of action. 



In our case the quantity 



a + b 



= 2/3, and since Nation I has 



estimated that Nation II* s course of action is ^ 2 ^^ “ 

propriate course of action for Nation I is The expected **value** 

of the return to Nation I from equation (2.3) is E | K (C^, 

* 3/2 (x + 1) • (0 - 2/3) + 1-1, which for x = 0 is equal to 
-1, and for x = 1 is equal to -2. From this we also see that 



Nat 



ion I®s course of action a^ maximizes its expected return. 
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Case ( 2 ) 



N = 3 ; Y can assume only three values y^, y2> and y^; 

Yl = 0, y2 where 0 - y2 < 1, and - !• 

Assumptions : 

( 1 ) Nation I*s preference ordering: 

(a^. bpXa^, b^)>(a2. b2> Xa^, b2> 

or equivalently 

(C, C)>(H, C)>(H, H)>(C, H) 

( 2 ) Quantification of Nation I*s preference ordering: 

l>l/2>-l>-2 

( 3 ) Nation I conjectures that Nation II may be employing the 

mixture of its courses of action represented by s= 1/3. 

( 4 ) Conjectured a priori probability: 



o 

o 


= Pr(Y 




= 1 / 3 ; 


^1 


= 0 


0 , 0 <>' 2 ) 


= Pr(Y 


= y 2 > 


= 1 / 6 ; 


^2 


= 1/3 




= Pr(Y 


= 73) 


= 1 / 2 ; 


^3 


= 1 



( 5 ) Predetermined value which the a posteriori probability must 
reach on a course of action or mixture of courses of action before 
Nation I is willing to estimate that Nation II is following that particu 
lar course of action or mixture of courses of action is taken as . 9 . 

Let us suppose that the first observation on Nation II* s courses 
of action results in b2* The a posteriori estimate becomes: 
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(1/3)(1) 

(i/3)(D^(i/6)(e/j; +■ (i/zyo) 



= 3/4 






g/6)(2/3) 

a/3Kl) + (l/6)(2/3)+ (1/^rO) 



1/4 




g/g)(Q) 

(l/3)(l)^(l/6)(?/3)+ (1/2K0) 



We see that from one observation Nation I is able to estimate 
that Nation II is pursuing either its course of action b^ or the 
mixture represented by ■ 1/3. Had the first observation result- 

ed in bj^, Nation I would have been able to estimate that Nation II 
was pursuing either its cours« of action b^ or the mixture represent- 
ed by = 1/3. 

Inasmuch as the a posteriori probability, that associated with 
yj^, is less than .9, Nation I makes another observation. Suppose 
that this observation again results in b 2 * The a posteriori pro- 
bability after this second observation is: 



'o,2<yi) - 

^0.2<>'3’ ' ° 

Since the a posteriori probability associated with y^^ is again less 
than .9, Nation I makes another observation and will continue to make 
observations until one of the a posteriori probabilities reaches .9. 

As long as subsequent observations do not result in b^, the a poster- 
iori probabilities remain distributed over y^^ and y^ with the probabil- 
ity over y^^ increasing after each observation. The total number of 
consecutive observations of b 2 required such that 
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:m0 




is four. Therefore, Nation I, after four observations, makes a 
point estimate of Y = 0 corresponding to II* s course of action 

b 2 * However, as soon as b^^ is observed, the a posteriori probability 
of one attaches to y 2 * For example supposing the third observation 
on the actions of Nation II resulted in b^^, the resulting a posteriori 
probability after this observation would be: 



2<yi> “ ° 

^.2<y3> ' ® 

Application of Bayes theorem will show that as soon as the opposite 
course of action from the one"' originally observed is noted, the a poster- 
iori probability of one attaches itself to y 2 * In this instance we 
see that Nation I estimates that Nation II is pursuing a mixture of its 
courses of action. 

Examination of Fig. 9 shows that Nation I*s preference pattern 
lies in Area III. In this area the decision rules for Nation I are: 

a t1 

(1) Y< ; select a«, the hot war course of action. 

a -»-b ^ 

a 1 

(2) Y> — — g ; select the cold war course of action. 



In our case the quantity 



ci + 1 



= 2/3. Therefore, for the instance 



where Nation I estimates that Nation II is following a course of action 
corresponding to Y = 0, the appropriate course of action for Nation I 

is The expected value of the return to Nation I from equation (2.3) 

for X = 0 is -1 and for x = 1 is -2. From this we also see 
that Nation I*s course of action a^ maximizes its expected return. 

In the instance where Nation I estimates that Nation II is follow- 
ing a mixture of courses of action corresponding to Y = 1/3, the 
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appropriate course of action for Nation I is The expected value 

of the return to Nation I from equation (2.3) for x = 0 is -1/2 
and for x = 1 is -1. From this we also see that Nation I®s course 
of action a^, maximizes its expected return. 
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Case (3) 



N =s 4; Y can assume only 4 values y 2 > Y^> 

^1 “ ^2 ^3 0 ^ y 2 y3 y4 = 

Assumptions : 

(1) Nation I*s preference ordering: 

(a^, bj)>(a2, b2>>>(a2, b^) Xa^, b2> 

or equivalently 

(C, C)>(H, H)>(H, C)>(C, H) 

(2) Quantification of Nation I's preference ordering: 

+ 1> -l>-2>-3 

(3) Nation I conjectures that Nation II may be employing the 

mixtures of its courses of action represented by y 2 = 2/7 and y^ 

= 6/7. 

(4) Predetermined value which the a posteriori probability must 
reach on a course of action or mixture of courses of action before 
Nation I is willing to estimate that Nation II is following that parti- 
cular course of action or mixture of courses of action is taken as .9. 

(5) Conjectured a priori probability: 





= Pr(Y = yp 


= 1/4; 


^1 


= 0 


CM 

>> 

O 

o 


= Pr(Y = y 2 > 


= 1/4; 


^2 


= 2/7 


0,0^^3^ 


= Pr(Y = y^) 


= 1/4; 


^3 


= 6/7 


0,0<>'4> 


= Pr(Y = y^) 


= 1/4; 


^4 


= 1. 



Let us suppose that the first observation on Nation II's courses 
of action results in b^. The a posteriori probability after this first 
observation is: 
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i , V X 

We see that from one observation Nation I is able to estimate that 
Nation II is pursuing either its course of action b^^ or a mixture of 
its courses of action. 

Inasmuch as the greatest a posteriori probability, that associated 
with y^, is less than .9, Nation I makes another observation. Suppose 
that this observation results in b^. The a posteriori probability after 
this second observation is: 



= 5/8 

EjjCyj) = 3/8 

=■ »• 

We see that when both of the courses of action b^^ and have 

been observed, the a posteriori probability assigns zero probabilities 
to y^ = 0 and y^ = 1. However, had the second and all subsequent 
observations resulted in b^^, the a posteriori probabilities would have 
remained distributed over y^> y^ with the maximum probability on 



^ 4 - 

Let us suppose that after two observations Nation I is required to 
implement one of its courses of action and suppose the observations have 
been and Since insufficient observations have been obtained for 

Nation I to arrive at a point estimate of Y on the basis of an a poster - 
iori probability greater than or equal to .9, I uses an alternate pro- 
cedure in order to select the appropriate course of action. This alter- 

.32 , 



\ 



is a 



nate procedure is defined as follows: 

it 

(2.4) choose for which '-flj ''Yp 

miximum for U = 1,2. 

In the example under consideration v;e see that: 



(y J = E fei,yj(0) + (578) ^ L(aiyyg)(3/8) - E (^i,y.,)(0) 



{^-(a-vb) |x- 
f - (a b) (x - 



b '1 

c^ > b 

b il 



/ 3-1 



Y3 



a » b) a < b 



a I D/ a b/ ■ a -» b 

= L-^5a~l/5)[2/r-2/S) -2/5 - 1 ] (5/8) + 



L-^5 (1- l/5)i6/r-2/5 ) ~ 2/5 -l] (3/8) 
- -6 5/58 -^9/56 
-- -1 



and 

^ECa^^y^-, )fi;i(yn)= £(&2,y, ;(0) + F/32,y2)(5/6’) + E(^^^y^)(5/8) -<-£ (a^.y^) 
- -S^/86 

Since Vh^ is greater than Yr.) -^L j (yn ) » 

Nation I employs his course of action a^. 

In this case we see that II' s mij^ture of courses of action having 

the greatest a posteriori probability, y 2 » lies in the sector to the 

a - I 

left of y* and also this same mixture is less than which is 

a * b 

equal to 2/5. Under the simple criterion of selecting a point estimate 
y^ for Y for which the associated a posteriori probability 
is greatest, I would have been lead to select action a 2 * Under the 
criterion of maximizing the expected value of return. Nation I is lead 
to selecting a^. Note, however, that the a posteriori probability on 
y 2 is 5/8, which is less than .9. Had the a posteriori probability on 
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I 



been greater than or equal to .9, Nation I would have also been 
lead to select action a^. 
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Case (4) 



Y is continuous 
Assumptions : 

(1) Nation I's preference ordering: 

(a^, bpXa^, bp>(a^, b^) (a^, b^) 

or equivalently 



(H, C)>(C, C)>(C, H) >(H, H) 

(2) Quantification of Nation I's preference ordering: 

+5 r +l >0>-l 

(3) Conjectured a priori probability density function, 

f(y), uniform, i.e. f(y) = 1 for 0 y ^1, and is 



zero elsewhere. 

Let us suppose that the first observation on Nation II 's courses 
of action results in b^^. The a posteriori probability density func- 
tion is by Bayes formula. 



( 2 . 2 ) 



fi : ( 



j(y) 



n (y ) (y ) '*■ (I- y ) ^ dy 



-f o r O = y = 1 
and z:€='ro elsewhere 



(1- yfdy 

This a posteriori probability density function may be pictured as follows 





A Posteriori Probability Density Function of 
Y after observation of b^ 






for 0 ^ \i ■i'i 
a nd Zt ro <? (5<? w hr 




Figure 10 
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Note that as a result of the single observation, the conjectured 
a priori probability density function is changed to an a posteriori 
probability density function biased in favor of the value of 1. Had 
the observation been the a posteriori probability density function 

would have been biased in favor of zero. 

In that case: 



r-' r 

i ( V/ 



/il) (y3 ’ (i-j/fdy 






for 0-: y ^ 1 

nd Tfro e i5 w n C' re 



and this a posteriori probability density function may be pictured as 
follows: , ^ 







A Posteriori Probability Density Function 
of Y after observation of 

Figure 11 

Let us suppose that Nation I after one observation is required 
to implement one of its courses of action. Nation I uses the alter 
nate procedure to decide which of its courses of action to utilize. 
The alternate procedure for the continuous case is: 

(2.5) Choose a^^ for which ) -fij (y)dy 

0 " 

is a maximum for K -1^^, 



In the example that v;e are considering we see that; 
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- h}-(at-b) 



(. -. 



a ^ b, 



V 




(b+ .;.)(ail) 
ci ' b 




c?y V 



\/ 




5(l-6/5)(y- 1/5) + 6/5 - l]/vdy 




-1 j^ydy 







= ^/3 



and 

^^(<\->:y'^5,o(y)dy = JC'5(0-6/5)(y-!/5)-^6/5 -l]?ydy ^ 5 

Since J 'f )+i,o^y '' *^y greater than 

c o'^ 

Nation I employs his course of action 
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